Online Learning from Experts: Minimax Regret

نویسندگان

Shivani Agarwal

Nikhil Vidhani

چکیده

In the last three lectures we have been discussing the online learning algorithms where we receive the instance x and then its label y for t = 1, ..., T . Specifically in the last lecture we talked about online learning from experts and online prediction. We saw many algorithms like Halving algorithm, Weighted Majority (WM) algorithm and lastly Weighted Majority Continuous (WMC) algorithm. We also saw bounds on the cumulative loss incurred by these algorithms. Today, we will focus on online prediction. For the WMC algorithm the setting is: we have N experts who predict the outcome (label) in [0,1], then we combine these predictions using a weighted average of these. Then we receive the true label and incur some loss (absolute loss in our setting), then we make an update to the weight vectors based on the loss. The intuition is that the higher the loss incurred by an expert, the more drastically we reduce its weight. For the WMC algorithm, we proved that:

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online Learning with Composite Loss Functions

We study a new class of online learning problems where each of the online algorithm’s actions is assigned an adversarial value, and the loss of the algorithm at each step is a known and deterministic function of the values assigned to its recent actions. This class includes problems where the algorithm’s loss is the minimum over the recent adversarial values, the maximum over the recent values,...

متن کامل

Online Nonparametric Regression with General Loss Functions

This paper establishes minimax rates for online regression with arbitrary classes of functions and general losses.1 We show that below a certain threshold for the complexity of the function class, the minimax rates depend on both the curvature of the loss function and the sequential complexities of the class. Above this threshold, the curvature of the loss does not affect the rates. Furthermore...

متن کامل

Robustness in portfolio optimization based on minimax regret approach

Portfolio optimization is one of the most important issues for effective and economic investment. There is plenty of research in the literature addressing this issue. Most of these pieces of research attempt to make the Markowitz’s primary portfolio selection model more realistic or seek to solve the model for obtaining fairly optimum portfolios. An efficient frontier in the ...

متن کامل

Abstracts - Workshop on Algorithmic Challenges in Machine Learning

The difficulty of an online learning problem is typically measured by its minimax regret. If the minimax regret grows sublinearly with the number of online rounds (denoted by T), we say that the problem is learnable. Until recently, we recognized only two classes of online learning problems: problems whose minimax regret grows at a slow rate of O(\sqrt(T)), and unlearnable problems with linear ...

متن کامل

Optimal Stragies and Minimax Lower Bounds for Online Convex Games

A number of learning problems can be cast as an Online Convex Game: on each round, a learner makes a prediction x from a convex set, the environment plays a loss function f , and the learner’s long-term goal is to minimize regret. Algorithms have been proposed by Zinkevich, when f is assumed to be convex, and Hazan et al., when f is assumed to be strongly convex, that have provably low regret. ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2011

Online Learning from Experts: Minimax Regret

نویسندگان

چکیده

منابع مشابه

Online Learning with Composite Loss Functions

Online Nonparametric Regression with General Loss Functions

Robustness in portfolio optimization based on minimax regret approach

Abstracts - Workshop on Algorithmic Challenges in Machine Learning

Optimal Stragies and Minimax Lower Bounds for Online Convex Games

عنوان ژورنال:

اشتراک گذاری